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m (54) Title: DIRECTORY FOR MULTI-PAGE SVG DOCUMENT 

(57) Abstract: An XML-based document which includes text for plural pages also includes directory information which stores 
location pointers for each of the plural pages. Preferably, the XML-based document is a document that conforms to the scalable 
)^ vector graphics (SVG) standard. An authoring tool and a viewer are disclosed. 
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DIRECTORY FOR MULTI-PAGE SVG DOCUMENT 

BACKGROUND OF THE INVENTION 

Field Of The Invention : 

The present invention relates to multi-page SVG-formatted 
documents (where "SVG" refers to Scalable Vector Graphics), and particularly 
relates to new element types that define a document directory including 
location information of each page. The invention also relates to authoring 
tools for such multi-page documents, as well as to viewers for viewing such 
documents. 

Description Of The Related Art : 

Scalable vector graphics ("SVG") is a language for describing 
two-dimensional graphics in extensible markup language ("XML"). SVG 
allows for three types of graphic objects: vector graphic shapes, images, and 
text. An SVG document is a plain-text document which includes plain-text 
descriptions of the graphics objects and their attributes, such as text which 
describes where the object is located (a "URI") or how to draw the object. The 
text is delimited by an opening tag and an accompanying closing tag which are 
collectively known as "element types". As an example, a "title" element type 
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might delimit text with an opening tag of the form <title> and a closing tag of 
</title>, although some element types might include opening and closing tags 
in a single line. 

The SVG language is maintained and defined by the World 
5 Wide Web Consortium (W3C), and details on the current grammar for SVG 

can be found at W3C's web site www.w3c.org. The current definition of SVG 
is defined by W3C in "Scalable Vector Graphics (SVG) 1.0 specification: 
W3C Recommendation 04 September 2001" available at the aforementioned 
web site and incorporated herein by reference. The specification includes the 

10 document type definition for SVG. A "document type definition" defines the 

rules and grammar to which the plural element types conform. 

As conceived by the W3C, SVG defines a graphical object 
viewable in a graphic window inside a document. More recently, however, it 
has been considered to extend the concept of SVG to that of a document itself, 

1 5 including multi-page documents. 

One such example has been proposed by the assignee of the 
present invention at Application No. 09/661,387, filed September 13, 2000, 
entitled "A Scalable Vector Graphics Print Driver". According to this 
application, the contents of which are incorporated herein by reference, a print 

20 driver accepts print output from an application program and, rather than 

generating information for a printer, instead generates an SVG document. The 
document may include multiple pages. The resulting SVG document may 
thereafter be used for any suitable purpose, such as posting on the Internet or 
inclusion in web pages. Moreover, because the SVG document is plain text 

25 and the SVG grammar is platform-independent, the SVG document can be 

transferred to users across computing platforms and thereby serves as a 
platform-independent document format. 

Through use of the print driver defined in the aforementioned 
Application No. 09/661,387, any application program that supports printing 

30 can output an SVG-formatted document thereby achieving platform 

independence for the output of any application that prints. 
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Another effort in developing print drivers that output S VG- 
formatted documents is provided by S VGmaker™ developed by Software 
Mechanics Party Ltd. of Brisbane, Australia, for which information is 
provided in its web site at www . s v gm aker . com . 
5 One problem recently encountered with multi-page SVG- 

formatted documents involves rendering and viewing such documents. 
Because SVG was originally conceived as a format for a graphic in a single 
window, to render the document in preparation for viewing it is necessary to 
parse the entire document, from beginning to end. For multi-page documents, 
10 however, parsing the entire document is time-consuming, particularly if the 

document includes many pages. Moreover, because the document cannot be 
completely rendered or viewed until parsing is completed, large delays are 
encountered even when viewing just one page of the document, or in moving 
from page to page. 



15 



SUMMARY OF THE INVENTION 



It is an object of the invention to provide element types that 
define a document directory structure, with the document directory structure 

20 including at least information concerning the location of individual pages in 

the SVG document. Because the location of individual pages in the SVG 
document can be determined, a viewer need only parse the locations needed to 
render a view of the current page. 

Thus, in one aspect, the invention involves an XML-based 

25 document such as SVG which includes text delimited by plural element types 

respectively defining at least vector graphic shapes, images and text, with each 
element type having an opening tag and a corresponding closing tag which 
collectively conform to a document type definition that defines rules for the 
plural element types. The XML-based document includes text for plural pages 

30 as well as page-information text, wherein the page-information text is 

delimited by page-information element types that define locations in the 
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document of the text for each of the plural pages. Preferably, the XML-based 
document further includes document-resource text delimited by document- 
resource element types that define resources (such as font definitions and the 
like) applicable across multiple pages or globally across the entire XML-based 
5 document. 

Because of the page-information text, a viewer rendering the 
XML-based document is not necessarily required to parse the entire XML- 
based document. Rather, it need only parse those locations in the document 
identified by the page-information text. Moreover, because the page- 

1 0 information text is delimited by page-information element types, the viewer 

can locate the page-information text quickly. 

Moreover, because of the document-resource text which is 
likewise delimited by document-resource element types, the viewer can 
quickly locate and process resources that are applicable across multiple pages 

15 or globally applicable. 

In particularly preferred embodiment, the XML-based 
documents includes a location pointer, at or near the beginning of the 
document, to directory-table text delimited by director-table element types that 
directly or indirectly define the location of the page-information element types. 

20 By virtue of these location pointers, a viewer parses the document even more 

quickly, since it is able to pinpoint exactly the locations within the document 
for which parsing is needed for any particular page. 

Preferably, the directory-table directly or indirectly includes 
information concerning the document-resource text, as well as information 

25 concerning thumbnail images for pages of the text and document attributes 

such as author and creation date. 

Further aspects of the invention involve an authoring tool such 
as a print driver for authoring XML-based documents according to the 
invention, as well as a viewing tool for parsing, rendering and viewing such 

30 XML-based documents. 
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This brief summary has been provided so that the nature of the 
invention may be understood quickly. A more complete understanding of the 
invention can be obtained by reference to the following detailed description of 
the preferred embodiment thereof in connection with the attached drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a view illustrating the outward appearance of 
computing equipment embodying the present invention. 

Figure 2 is a detailed block diagram of the computing 
equipment shown in Figure 1. 

Figure 3 is a flow diagram for explaining a document viewer 
according to the invention. 

Figure 4 is a view illustrating the display of multi-page 
documents by a document viewer. 

Figure 5 is a flow diagram for explaining a second embodiment 
of the invention. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

Figure 1 is a view showing the outward appearance of 
representative computing equipment 10 which includes image-based gamut 
mapping according to the invention. Computing equipment 1 0 includes host 
processor 1 1 which comprises a personal computer (hereinafter "PC") 
preferably having a windowing operating system such as Microsoft Windows, 
Xwindows or Macintosh operating systems. Provided with computing 
equipment 10 are color monitor 12 including display screen 14, keyboard 15 
for entering text data and user commands, and pointing device 16. Pointing 
device 16 preferably comprises a mouse, for pointing, selecting and 
manipulating objects displayed on display screen 14. 
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Computing equipment 10 includes a computer-readable 
memory medium such as a fixed disk 17 and/or floppy disk drive 19 and/or 
CDROM drive 20. Such computer readable memory media allow computing 
equipment 10 to access information such as image data, computer executable 
5 process steps, application programs, and the like, stored on removable and 

non-removable memory media. In addition, network access 21 allows 
computing equipment 10 to acquire information, images and application 
programs from other sources, such as a local area network or the internet, or 
from digital cameras or digital video cameras. 

10 Figure 2 is a detailed block diagram showing the internal 

architecture of PC 1 1. As shown in Figure 2, PC 1 1 includes central 
processing unit ("CPU") 25 that interfaces with computer bus 26. Also 
interfacing with computer bus 26 are fixed disk 17, network interface 27 for 
network access 21, random access memory ("RAM") 29 for use as main 

15 memory, read only memory ("ROM") 30, floppy disk interface 31, CDROM 

interface 32, display interface 34 to monitor 12, keyboard interface 36 to 
keyboard 15, mouse interface 37 to pointing device 16, and printer interface 39 
to printer 24. 

Main memory 29 interfaces with computer bus 26 so as to 
20 provide RAM storage to CPU 25 during execution of software programs such 

as the operating system, application programs, and device drivers. More 
specifically, CPU 25 loads computer-executable process steps from disk 17 or 
other memory media into a region of main memory 29, and thereafter executes 
the stored process steps from main memory 29 in order to execute software 
25 programs. 

As also shown in Figure 2, fixed disk 17 contains a window 
operating system 41, application programs 42 such as application programs 
that obtain, manipulate, and print multi-page documents, multi-page SVG 
documents 43, device drivers 44 such as printer driver 45, SVG authoring tool 
30 46 which authors multi-page SVG documents, and an SVG viewer 47 which 
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reads SVG data files including multi-page SVG files and displays them on 
monitor 12. 

Authoring of multi-page SVG documents according to the 
invention is preferably implemented as part of printer driver 45, but it may be 
5 implemented as a stand-alone software application program. It is also possible 

to implement the authoring tool as a dynamic link library ("DLL"), or as a 
plug-in to other application programs such as Word™ from Microsoft 
Corporation or QuarkXPress™ from Quark, Inc. Likewise, SVG viewer 47 is 
preferably implemented as a stand-alone software application program, but can 

10 be implemented as a DLL or a plug-in to another program such as Internet 

Explorer™ from Microsoft Corporation. 

Authoring tool 46 is arranged to output XML-based documents 
which include text delimited plural element types respectively defining at least 
vector graphic shapes, images, and text. Each element type has an opening tag 

1 5 and a corresponding closing tag which collectively conform to a document 

type definition that defines rules and grammar for the plural document types. 
Preferably, the XML-based document is an SVG-formatted document 
according to the W3C standard, in which case the document type definition 
(hereinafter "DTD") is that defined by the W3C and augmented by the DTD 

20 described below. 

The SVG-based document includes text for plural pages. The 
text need not be the text that actually appears in the rendered document, but 
rather might include text that signifies a web-based URI where the text might 
be located, or text that includes other SVG-based element types for defining 

25 aspects of each page of the document. Preferably, text for each page of the 

document is grouped together with other aspects of the page, but it is possible 
that the text of the page is disjointed and dispersed throughout the SVG 
document so long as the page location information (described below) properly 
identifies the location of all elements in any one page. 

30 In outputting text for the XML-based document, authoring tool 

46 also outputs text that defines resources used by one or more of the pages, or 
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used globally by all of the pages. Such resources include, for example, font 
definitions, shapes, clip regions and the like. 

Authoring tool 46 also outputs directory-exists text delimited 
by directory-exists element types that signify that the SVG document contains 
5 directory information which, in turn, directly or indirectly indicates location of 

each page in the multi-page document. Preferably, the directory-exists text is 
located at or near the beginning of the SVG document, so that a viewer parsing 
the SVG document will encounter it early and be able to locate and retrieve 
location information of each page of text without necessarily parsing the entire 

1 0 SVG document. Such directory information is preferably written by authoring 

tool 46 at the end of the SVG document, so as to simplify parsing by viewers. 

In the present invention, the directory-exists text points to a 
directory-table element which, in turn, indirectly provides location information 
for each page in the multi-page document. The reason for indirect information 

15 is to permit additional layers of abstraction in the overall directory structure, 

thereby achieving a more generalized and extensible directory description of 
the document. 

The directory-table element includes directory-table text 
delimited by directory-table element type, and merely includes an additional 

20 location pointer to table-list text. This level of indirection is desirable since it 

ensures that the directory-table text is always located at the same position 
(relative to the end of the SVG document) in all multi-page SVG documents 
according to the invention. 

The directory-table-list text includes text delimited by 

25 directory-table-list element types that define at least one directory-definition 

element followed by zero or more doc-resource elements. The doc-resource 
elements includes text specifying a location pointer for resources in the SVG 
document that are applicable across one or more of the pages such as font 
definitions, and the like. The doc-resources text is delimited by doc-resources 

30 element types. 
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The directory-definitions text is delimited by directory- 
definition element types and specifies the location of a directory-table- 
information element. The directory-information-element, in turn, includes text 
specifying the number of pages in the document as well as at least one page- 
5 directory element. The directory-information text further may include zero or 

more thumbnail directory elements and doc-attribute elements. 

The page-directory text element contains one page-information 
element for each page in the SVG document. Each page-information element, 
in turn, includes text that specifies the location in the SVG document of text 

10 corresponding to each page of the multi-page document. Preferably, each page 

of the SVG document is grouped together, such that only one page- 
information element is needed for each page, but it is possible to include more 
than one page-information element in a case where text for each page is 
dispersed throughout the multi-page document. 

1 5 Returning to the directory-information element, this element 

may also include zero or more thumbnail-directory elements and zero or more 
document-attribute elements. The thumbnail-information elements contain 
definitions for thumbnail graphics that represent each page in the multi-page 
document. The document-attributes element includes document attributes 

20 such as author and creation date. 



[Document Type Definition ("DTD")] 

The foregoing provides a brief description of the elements for 
the document directory. A complete copy of the preferred DTD follows and is 
25 used to augment the DTD provided by W3C for SVG documents: 



30 



<! ELEMENT docDirectoryExists EMPTY> 
<!ATTLIST docDirectoryExists 

offset %Integer; #REQUIRED> 



<! ELEMENT pageStart EMPTY> 
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<! ATTLIST pageStart> 

<! ELEMENT pageEnd EMPTY> 
<! ATTLIST pageEnd> 

5 

<! ELEMENT docDirectoryTable EMPTY> 

<! ATTLIST docDirectoryTable 

offset %Integer; #REQUIRED 
byteCount %Integer; #REQUIRED > 

10 

<! ELEMENT docDirectoryDefs EMPTY> 

<! ATTLIST docDirectoryDefs 

offset %Integer; #REQUIRED 
byteCount %Integer; #REQUIRED > 

15 

<! ELEMENT docResources EMPTY> 

<! ATTLIST docResources 

offset %Integer; #REQUTRED 
byteCount %Integer; #REQUTRED > 

20 

<! ELEMENT directoryTableList (docDirectoryDefs, docResources*)> 
<! ATTLIST directoryTableList> 

<! ELEMENT docPageDir (docPageInfo+) > 
25 <! ATTLIST docPageDir> 

<! ELEMENT docThumbnailDir (docThumbnaillnfo*) > 
<! ATTLIST docThumbnailDir> 



30 



<! ELEMENT docPagelnfo EMPTY> 
<! ATTLIST docPagelnfo 



WO 03/048922 



PCT/US02/38840 



-11- 

pageNumber %Integer; #REQUIRED 
offset %Integer; #REQUIRED 
byteCount %Integer; #REQUIRED 
width %Length; #REQUIRED 
height %Length; #REQUIRED 
color (true | false) #IMPLIED > 

<! ELEMENT docThumbnaillnfo (image)> 
<!ATTLIST docThumbnaillnfo 

pageNumber %Integer; #REQUTRED > 

<! ELEMENT docAttributes EMPTY> 
<! ATTLIST docAttributes 



author 


CDATA 


#IMPLIED 


creationDate 


CDATA 


#IMPLIED 


modifiedDate 


CDATA 


#IMPLIED 


title 


CDATA 


#IMPLIED 


subject 


CDATA 


#IMPLIED 


lastSavedBy 


CDATA 


#IMPLIED 


revisionNumber 


CDATA 


#IMPLIED 


applicationName 


CDATA 


#IMPLIED 


companyName 


CDATA 


#IMPLIED 


lastViewedDate 


CDATA 


#IMPLIED 


lastViewedBy 


CDATA 


#IMPLIED 


keywords 


CDATA 


#IMPLIED 


originalFileName 


CDATA 


#IMPLIED > 



<! ELEMENT docDirectorylnfo (docPageDir, docThumbnailDir, 

docAttributes) > 

<! ATTLIST docDirectorylnfo 

pageCount %Integer; #REQUIRED > 
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[Definitions of Element Types] 

A more complete definition of the elements is provided below: 

5 [docDirectoryExists Element] 

The docDirectoryExists element indicates that the SVG 
document file contains directory information and specifies the location (byte 
offset from the end of the file) of the docDirectoryTable element. 

10 <!Element docDirectoryExists EMPTY> 

<!ATTLIST docDirectoryExists 

offset %Integer; #REQUIRED > 

Attribute definitions: 

1 5 offset = "<integer>": The byte offset from the end of the 

document file to the docDirectoryTable element. 

[docDirectoryTable Element] 

The docDirectoryTable element specifies the location (byte 
20 offset from the end of the file) and size (byte count) of the directoryTableList 

element. 

<!Element docDirectoryTable EMPTY> 
<!ATTLIST docDirectoryTable 
25 offset %Integer; #REQUIRED 

byteCount %Integer; #REQUIRED > 

Attribute definitions: 

offset = "<integer>": The byte offset from the end of the 
30 document file to the directoryTableList element. The value string consists of 

four digits left padded with zeros. 
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byteCount = "<integer>": The length of the directoryTableList 
element in bytes. The value string consists of four digits left padded with 
zeros. 

5 [directoryTableList Element] 

The directoryTableList element contains one docDirectoryDefs 
element followed by zero or more docResources elements. 

<! Element directoryTableList (docDirectoryDefs, docResources*)> 
10 <!ATTLIST directoryTableList> 

Attribute definitions: 
None 



15 [docDirectoryDefs Element] 

The docDirectoryDefs element specifies the location (byte 
offset from the beginning of the file) and size (byte count) of the 
directoryTablelnfo element. 

20 <!Element docDirectoryDefs EMPTY> 

<!ATTLIST docDirectoryDefs 

offset %Integer; #REQUIRED 
byteCount %Integer; #REQUTRED > 

25 Attribute definitions: 

offset = "<integer>": The byte offset from the beginning of the 
document file to the docDirectorylnfo element. 

byteCount = "<integer>": The length of the 
directoryDefinitions element in bytes. 



30 
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[docResources Element] 

The docResources element specifies the location (byte offset 
from the beginning of the file) and size (byte count) of SVG definitions 
(embedded font definitions, defs elements, etc.) reference on multiple pages. 

5 

<! Element docResources EMPTY> 
<! ATTLIST docResources 

offset %Integer; #REQU1RED 

byteCount %Integer; #REQUIRED > 

10 

Attribute definitions: 

offset = "<integer>": The byte offset from the beginning of the 
document file to the start of the shared resource definitions. 

byteCount = "<integer>": The length of the shared resource 
1 5 definitions in bytes. 



[docDirectorylnfo Element] 

The docDirectorylnfo element specifies the number of pages in 
the SVG document and contains the required docPageDir element. This 
20 element also may contain the optional docThumbnailDir and docAttributes 

elements. 



<!Element docDirectorylnfo (docPageDir, docThumbnailDir?, 
docAttributes?)> 
25 <! ATTLIST docDirectorylnfo 

pageCount %Integer; #REQUIRED > 



30 



Attribute definitions: 

pageCount 

document. 



= "<integer>": The number of pages in the SVG 
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[docAttributes Element] 

The docAttributes element specifies the values of the SVG 
document information keys. 

5 <!Element docAttributes EMPTY> 

<! ATTLIST docAttributes 



author 


/~^T\ ATA 

CDAl A 


ffilVLrLJiiL) 


creationDate 




JITTUTIT TT7n 

#lMFLLhL) 


modifiedDate 


CDATA 


#IMPLIED 


title 


CDATA 


#IMPLffiD 


subject 


CDATA 


#IMPLIED 


lastSavedBy 


CDATA 


#IMPLIED 


revisionNumber 


CDATA 


#IMPLffiD 


applicationName 


CDATA 


#IMPLffiD 


companyName 


CDATA 


#IMPLIED 


lastViewedDate 


CDATA 


#IMPLIED 


lastViewedBy 


CDATA 


#IMPLffiD 


keywords 


CDATA 


#IMPLIED 


originalFileName 


CDATA 


#IMPLffiD > 



20 

Attribute definitions: 

author = "<cdata>": Document author's name. 

creationDate = "<cdata>": Date the document was created 
using this format "year/month/day hounminutes: seconds". 
25 modifiedDate = "<cdata>": Date the document was last 

modified. 

title = "<cdata>": Title of the document, 
subject = "<cdata>": Subject of the document. 
lastSavedBy = "<cdata>": Name of individual that last 
30 modified the document. 

revisionNumber = "<cdata>": Document revision number. 
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applicationName = "<cdata>": Name of application which 
created the SVG document. 

companyName = "<cdata>": Name of company which created 

the document. 

5 lastViewedDate = "<cdata>": Date the document was last 

viewed. 

lastViewedBy = "<cdata>": Name of individual that last 
viewed the document. 

keywords = "<cdata>": Document search keywords. 
10 originalFileName = "<cdata>": Original file name of the 

document. 

[docPageDir Element] 

The docPageDir element contains one docPagelnfo element for 
1 5 each page in the SVG document. 

<!Element docPageDir (docPageInfo+) > 
<!ATTLIST docPageDir> 

20 Attribute definitions: 

None. 

[docThumbnailDir Element] 

The docThumbnailDir element contains one docThumbnaillnfo 
25 element for each page thumbnail present in the SVG document. 

<! Element docThumbnailDir (docThumbnaillnfo*) > 
<! ATTLIST docThumbnailDir> 



30 



Attribute definitions: 
None. 
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[docPagelnfo Element] 

The docPagelnfo element specifies the page number, location 
(byte offset from the beginning of the file) and size (byte count) of a page in 
the SVG document. The optional color attribute specifies whether the page 
5 uses colors other then white, black, or gray (if not specified, default is false). 



<!Element docPagelnfo EMPTY> 
<!ATTLIST docPagelnfo 

pageNumber %Integer; #REQUIRED 
1 0 offset %Integer; #REQUIRED 

byteCount %Integer; #REQUIRED 

width %Length; #REQUIRED 

height %Length; #REQUIRED 

color (true | false) #IMPLIED > 

15 

Attribute definitions: 

pageNumber = "<integer>": The page number this element 
describes. The first page is 1 . 

offset = "<integer>": The byte offset from the beginning of the 
20 document file to the pageStart element. 

byteCount = "<integer>": The length of the page definition 
elements in bytes. 

width = "<length>": The width of the page in real number 
format optionally followed by a standard unit identifier such as pt, cm, or in. 
25 height = "<length >": The height of the page in real number 

format optionally followed by a standard unit identifier such as pt, cm, or in. 

color = "<true | false>": Indicates whether the page uses colors 
other then white, black, or gray. 



30 
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[docThumbnaillnfo Element] 

The docThumbnaillnfo element contains the definition of the 
graphics for a page thumbnail. The required pageNumber attribute specifies 
the page number of the SVG document page the thumbnail represents. 

5 

<!Element docThumbnaillnfo (image)> 
<! ATTLIST docThumbnaillnfo 

pageNumber %Integer; #REQUIRED > 

1 0 Attribute definitions : 

pageNumber = "<integer>": The page number this element 
defines the thumbnail for. The first page is 1 . 

[An Example] 

15 A simple example of an SVG-based document with plural 

pages is provided below. There are exactly two pages, the first page consisting 
only of the text "Page one" and the second page consisting only of the text 
"Page two". The SVG-based document includes global resources in the form 
of a clip path, and further includes directory information according to the 

20 invention. 

The example also highlights that the SVG-based document 
need not include an explicit reference to the DTD with which the element 
types conform. Rather, it is sufficient for the element types to conform to the 
DTD, even if a reference to the DTD is not given explicitly: 

25 

<?xml version="1.0" standalone="no"?> 
<!-- Canon SVG Driver Copyright (C) 2001 Canon Inc. --> 
<svg width-"612pt" height="792pt"> 
<docDirectoryExists offset="597> 
30 <defs> <!— SVG: commonly used clippath ~ > 

<clipPath id="clipPath0"> 
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<rect x="21.3" y="482.4" width="18.25" height="7.57"/> 

</clipPath> 

</defs> 

<text x="15.0" y="16.6">Page one</text> 

5 

<text x="15.0" y="16.6">Page two</text> 

<docDirectoryInfo pageCount="2"> 
<docPageDir> 

1 0 <docPageInfo pageNumber=" 1 " offset="308" 

byteCount=" 1235" width="6 1 2.000pt" 
height="792.000pt'7> 
<docPageInfo pageNumber="2" offset="1543" 

byteCount="2357" width="8.5in" height="llin" 
15 color="true"/> 

</docPageDir> 
<docThumbnailDir> 

<docThumbnailInfo pageNumber=" 1 "> 

<image x="0" y="0" width="18" height="23" 

20 xlink:href="data:image/png;base64, " /> 

</docThumbnailInfo> 
</docThumbnailDir> 
<docAttributes author="Loren Wood" 

creationDate="2001/l 1/28 17:05:43" 
25 title="SVG Document Directory Element Definitions" 

revisionNumber="0. 1 0" 

./> 

</docDirectoryInfo> 
30 <directoryTableList> 

<docDirectoryDefs offset="3900" byteCount="423'7> 
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<docResources offset="163" byteCount="1457> 
</directoryTableList> 

<docDirectoryTable offset="0204" byteCount="0145'7> 

</svg> 

5 

[SVG Viewer 47] 

Figure 3 is a flow diagram illustrating operation of SVG viewer 
47 in parsing, rendering and displaying SVG documents including multi-page 
SVG documents having directory information according to the invention. 

10 Briefly, according to Figure 3, SVG viewer 47 determines 

whether document directory information for a multi-page SVG document 
exists, and if such information exists, SVG viewer 47 obtains location 
information corresponding to the text of each page in the multi-page 
document SVG viewer 47 also obtains location information for document 

15 resources used by one or more of the pages, or used globally by the entire 

document. Based on this information and on the current page being viewed, 
SVG viewer 47 parses text for the current page and renders the current page 
using any document resources needed for the current page. The rendered page 
is thereafter displayed to the user on monitor 12, together with any thumbnail 

20 or document attribute information that might be included in the document 

directory. 

In more detail, in step S301 SVG viewer 47 parses for a 
directory-exists element. If no such element is found (step S302), then flow 
branches to step S304 in which viewer 47 parses the entire SVG document, 

25 renders it (step S305), and displays it (step S306), all in accordance with 

known techniques for parsing, rendering and displaying SVG documents that 
do not include multi-page information. 

If a directory-exists element is found, flow continues with step 
S308 in which viewer 47 jumps to the directory-table element and, based on 

30 the location pointer in the directory-table element, thereafter jumps to the 

directory-table-list element (step S309). If the directory-table-list element 
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contains a document-resources element (step S3 10), then flow branches to step 
S3 1 1 in which viewer 47 parses document resources at the locations pointed 
by the text in the document-resources element. In either event, flow thereafter 
continues in step S3 13 in which viewer 47 jumps to the directory-information 
5 element based on the location pointer contained in the directory-definition 

element of the directory-table- list element. 

In step S3 14, viewer 47 obtains the page count from the 
directory-information element. For the current page, execution of step S3 15 
causes viewer 47 to obtain page location information from a corresponding 

10 page-information element or elements, depending on whether information for 

each page is grouped together within the multi-page SVG document. 

In step S3 16, viewer 47 parses the SVG document for text at 
the location for the current page. Ordinarily, viewer 47 will parse the SVG 
document only for the current page, to the exclusion of other pages, such that 

1 5 the parsing and rendering process is faster than otherwise would have been 

performed according to steps S304 through S306 in cases where the document 
directory does not exist. 

If a thumbnail-directory element exists (step S3 17), then viewer 
47 obtains corresponding thumbnail information (step S3 18). Likewise, if the 

20 document-attribute element exists (step S320), then viewer 47 obtains 

corresponding document attribute information (step S321). 

In step S323, viewer 47 renders the current page (or pages) 
using any needed document resources obtained in step S3 1 1 . Thereafter, in 
step S325, viewer 47 displays the current page with thumbnail and document 

25 attributes, if such exist. If a request is received to display a new page (step 

S326), flow returns to step S3 15 to repeat the process for the newly-requested 
page. 

Figure 4 is a view illustrating the output of viewer 47 upon 
processing of the S VG-based document provided in the example above. As 
30 shown in Figure 4, viewer 47 provides a display window 81 on monitor 12 

which includes a page display area 82, a thumbnail display area 83, and a 
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attribute area 84. The page display area 82 includes a full rendering of one or 
more current pages in accordance with steps S323 and S325. The thumbnail 
area 83 displays any thumbnails displayed in steps S3 17 and S3 18, whereas 
any document attributes obtained in steps S320 and S321 are displayed in 
5 attribute area 84. 

For the currently displayed page corresponding to steps S3 15 to 
S325 (which, in this drawing is "Page 2"), the thumbnail image is visually 
highlighted at 85 so as to provide navigational feedback to the user on the 
location of the current page relative to the overall document. 

10 Figure 5 is a flow diagram illustrating operation according to a 

second embodiment, which features progressive display of pages in page 
display area 82. In Figure 5, steps S501 through S514 are generally similar to 
corresponding steps S301 through S3 14 in Figure 3, and the description 
thereof is omitted for the sake of brevity. 

15 In step S5 1 5, the viewer obtains information for the currently- 

selected page only. In particular, the viewer obtains page location information 
from a corresponding page-information element or elements, depending on 
whether information for each page is grouped together within the multi-page 
SVG document. 

20 In step S5 1 6, the viewer parses the SVG document for text at 

the location of the current page. In view of the progressive nature of this 
embodiment, the viewer parses the SVG document only for the current page, 
to the exclusion of other pages, and particularly to the exclusion of pages 
adjacent the current page. 

25 If a thumbnail-directory element exists (step S517) then for the 

current page the viewer obtains corresponding thumbnail information (step 
S518). Likewise, if the document-attribute element exists for the current page 
(step S520), then the viewer obtains corresponding document attribute 
information (step S512). In step S523, the viewer renders the current page 

30 only, using any needed document resources obtained thus far. Thereafter, in 

step S525, the viewer displays the current page together with thumbnail and 
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document attributes. The current page is displayed in display area 82. Pages 
adjacent the current page are not yet displayed (that is, are not displayed as of 
the execution of step S525), but preferably placeholder images (such as ghost 
outlines) are displayed for pages adjacent the current page. The placeholder 
5 images are displayed in display area 82. 

Since only placeholder images are displayed for pages adjacent 
the current page, the current page is displayed more quickly and in a 
progressive manner, thereby speeding the display of the currently-selected 
page. Thereafter, pages adjacent the current page are rendered (step S526) and 

10 displayed (step S527). This provides a progressive display in which the 

current page is displayed first, followed by display of pages adjacent the 
current page. This progressive display enhances the speed with which the 
current page is displayed, yet maintains navigational cues for the user to 
advance to different pages. 

15 In step S528, if a request is received to display a new page as 

the current page, then flow returns to step S515 to repeat the process for the 
newly-requested current page. Any already-rendered pages are reused in this 
display. For example, if the user selects an adjacent page as the new current 
page, then the former current page is displayed without re-rendering it. 

20 The invention has been described with respect to particular 

illustrative embodiments. It is to be understood that the invention is not 
limited to the above-described embodiments and that various changes and 
modifications may be made by those of ordinary skill in the art without 
departing from the spirit and scope of the invention. 

25 



WO 03/048922 



PCT/US02/38840 



-24- 

WHAT IS CLAIMED IS : 

1 . In an XML-based document which includes text 
delimited by plural element types respectively defining at least vector graphic 

5 shapes, images and text, each element type having an opening tag and a 

corresponding closing tag which collectively conform to a document type 
definition that defines rules for the plural element types, the improvement 
comprising: 

the XML-based document includes text for plural pages as well 
10 as page-information text, wherein the page-information text is delimited by 

page-information element types that define location in the document for each 
of the plural pages. 

2. An XML-based document according to Claim 1, further 

1 5 comprising page-directory text which includes the page-information text, the 

page-directory text being delimited by page-directory element types which 
contain at least one page-information element for each of the plural pages. 

3. An XML-based document according to Claim 2, further 
20 comprising directory-information text which includes the page-directory text, 

the directory-information text being delimited by directory-information 
element types that specify the number of the plural pages and which contains 
the page-directory elements. 

25 4. An XML-based document according to Claim 3, wherein the 

directory-information text further includes thumbnail-information text that 
defines a graphic for each of the plural pages, the thumbnail-information text 
being delimited by thumbnail-information element types. 
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5. An XML-based document according to Claim 4, further 
comprising thumbnail-directory text which includes the thumbnail-information 
text and which is delimited by thumbnail-directory element types. 

5 6. An XML-based document according to Claim 3, further 

comprising document-attribute text delimited by document-attribute element 
types and that specify attributes of the document. 

7. An XML-based document according to Claim 1, further 
10 comprising document-resource text that specifies a location pointer in the 

XML-based document for resources used by pages in the XML-based 
document. 

8. An XML-based document according to Claim 8, further 

1 5 comprising a directory-table-list element that includes the document-resource 

text and at least an indirect reference to the page-information text. 

9. An XML-based document according to Claim 8, further 
comprising document-directory-table text delimited by document-directory- 

20 table types and that provides location information of the directory-table-list 

text. 

10. An XML-based document according to Claim 1, wherein 
the page-information text is included in document-directory information stored 

25 in the XML-based document. 

11. An XML-based document according to Claim 10, wherein 
the document-directory information is stored at or near the end of the XML- 
based document. 

30 
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12. An XML-based document according to Claim 11, further 
comprising a directory-exists element delimited by directory-exists element 
types, the directory-exists elements directly or indirectly defining the location 
of the document-directory information, the directory-exists element being 

5 located at or near the beginning the XML-based document. 

13. An XML-based document according to Claim 1, wherein 
the XML-based document is comprised of an SVG-based document. 

10 14. An authoring tool for an XML-based document, wherein 

the XML-based document includes text delimited by plural element types 
respectively defining at least vector graphic shapes, images and text, each 
element type having an opening tag and a corresponding closing tag which 
collectively conform to a document type definition that defines rules for the 
15 plural element types, 

wherein the authoring tool comprises: 

means for creating text for plural pages in the XML-based 

document; and 

means for creating page-information text, wherein the page- 
20 information text is delimited by page-information element types that define 

location in the document for each of the plural pages. 

15. An authoring tool for an XML-based document according 
to any one of Claims 1 to 13. 



25 



16. An XML-based document according to Claim 15, wherein 
the authoring tool is comprised of a print driver. 



30 



17. An XML-based document according to Claim 15, wherein 
the authoring tool is comprised of a stand-alone application. 
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18. An XML-based document according to Claim 15, wherein 
the authoring tool is a plug-in to another application program. 

19. A viewer for an XML-based document, wherein the XML- 
5 based document includes text delimited by plural element types respectively 

defining at least vector graphic shapes, images and text, each element type 
having an opening tag and a corresponding closing tag which collectively 
conform to a document type definition that defines rules for the plural element 
types, 

10 wherein the viewer comprises: 

means for receiving a user selection of a current page; 
means for parsing text in the XML-based document based on 

the selected current page and based on the page-information text for the 

current page; and 

15 means for rendering an image of the current page based on the 

parsed text. 

20. A viewer for an XML-based document according to Claim 
19, wherein said means for rendering progressively renders the image of the 

20 current page and images of pages adjacent the current page. 

21 . A viewer for an XML-based document according to any 
one of Claims 1 to 13, wherein the viewer parses the XML-based document 
based on a current page being viewed and based on the page-information text, 

25 so as to render and display the current page. 

22. A viewer for an XML-based document according to Claim 
21, wherein the viewer is comprised of a stand-alone application. 



30 



23. A viewer for an XML-based document according to Claim 
21, wherein the viewer is a plug-in to another application program. 
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24. A viewer for an XML-based document according to Claim 
21, wherein the current page and pages adjacent the current page are displayed 
progressively. 
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